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Abstract 

This paper studies multi-user communication systems with two groups of users: leaders which possess 



wo; 

system information, and followers which have no system information using the formulation of Stackelberg 



games. In such games, the leaders play and choose their actions based on their information about 

in 

fN) ' the system and the followers choose their actions myopically according to their observations of the 

aggregate impact of other users. However, obtaining the exact value of these parameters is not practical 
in communication systems. To study the effect of uncertainty and preserve the players' utilities in these 
conditions, we introduce a robust equilibrium for Stackelberg games. In this framework, the leaders' 
information and the followers' observations are uncertain parameters, and the leaders and the followers 
choose their actions by solving the worst-case robust optimizations. We show that the followers' uncertain 
, parameters always increase the leaders' utilities and decrease the followers' utilities. Conversely, the 

■ leaders' uncertain information reduces the leaders' utilities and increases the followers' utilities. We 

in 

• ' illustrate our theoretical results with the numerical results obtained based on the power control games in 

00 1 

C> the interference channels. 

Index Terms 

• rH 

X 

$_i ' Robust game theory, resource allocation, Stackelberg game, worst-case robust optimization. 

I. Introduction 

Stackelberg games provide a general framework to analyze and design hierarchical interactions among 
rational, self-interested decision makers (players) [H, O. These hierarchical non-cooperative games 
consist of two groups of players: leaders, which have complete information about the other players, 
and followers, which have no system information. First, each leader selects its action by solving a 
bi-level optimization problem which seeks to maximize the leader's utility subject to the followers' 
actions as estimated by the leader's information. The followers then select their actions according to their 
observations from the aggregate impact of other users. 
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Recently, the authors in EJ— O have formalized the power control problem in interference channels 
as Stackelberg games. In these papers, the utility of each user is its throughput and its action is its 
transmission power. The leaders' information contains the direct and interference channel gains of all the 
users, and the impacts of users on each other are the amount of interference at their receivers. In this 
system model, the leaders first determine their transmit powers. Next, the followers' receivers measure 
(observe) the amount of interference and feed back this value to their transmitters. Finally, the followers' 
transmitters determine their transmission powers based on this estimated value of the interference. The 
information possessed by the leaders results in an increase in the leaders' throughput as well as the 
followers' throughput in some cases (see e.g., 0). 

However, extracting accurate system information either by the leaders or the followers is a key 
practical issue when implementing Stackelberg games in communication systems because obtain accurate 
information is costly and difficult. For example in wireless communication systems, fading, channel 
noise, delayed in feedback channels, and users' mobility cause uncertainty in the players' parameters. 
Consequently, not knowing the accurate values of these parameters may lead to worse utilities for both 
the leaders and the followers. Therefore, it is essential to consider these uncertain parameters and apply 
the robust approach to guarantee the utilities of both leaders and followers. 

Robust optimization theory is a branch of applied mathematics to mitigate the impact of uncertain 
parameters in the solution of optimization problems. In this context, the uncertain parameter is first 
modeled as a nominal (estimated) value plus an additive error (the uncertain part) J6J. In the next step, 
the optimization problem with nominal values (referred to the nominal optimization problem) is mapped 
to another optimization problem (called the robust counterpart) in which the uncertain parameter is a 
new optimization variable Q, ll8l . 

Generally two basic approaches are applied to define this maping 0, Q, (9l: Bayesian approaches 
where the statistics of error is considered and the utility is statistically guaranteed, while the worst-case 
approaches where the error is assumed to be bounded within a specific region (uncertainty region) and 
the utility is guaranteed for any realization of error within this region. Both these approaches have been 
applied in communications, economics, and mathematics to tackle uncertain parameters in Stacklberg 
games fT0l - lfl4l . In this paper, we choose a worst-case approach to guarantee the players' utilities under 
uncertainty within a uncertainty region. Based on the above terminology, we refer to the Stackelberg game 
and its equilibrium with nominal parameters as the nominal Stackelberg game (NSG) and the nominal 
Stackelberg equilibrium (NSE), respectively. Moreover, the Stackelberg game and its equilibrium where 
the uncertainty exist and robust optimization is applied, we call the robust Stackelberg game (RSG) and 
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the robust Stackelberg equilibrium (RSE), respectively. 

The most related work to this paper is lfl4l which considers one-leader one-follower Stackelberg 
game and assumes that the leader does not know the exact values of some parameters to estimate 
the follower's action. By minimizing the second order sensitivity function of the leader's utility with 
respect to the uncertain parameters, the paper determines the worst-case utility for the leader given its 
imperfect information. However, to implement the RSG in communication systems, we encounter different 
challenges: 1) What is the definition of the RSE if the leaders and followers have different uncertain 
parameters? 2) What is the performance of the system at the RSG compared to that of the NSG? 3) How 
can we overcome the additional computational complexities involved where the leader needs to solve 
the bi-level robust optimization problem? 4) How can we generalize the RSG for multiple-leaders and 
multiple-followers communication scenario? 

To answer the above questions, we first distinguish between the players' uncertain parameters. As 
stated before, in the Stackelberg game, the actions of the leaders and the followers are determined by 
the leaders' information and the followers' observations about the aggregate impact of other players, 
respectively. Consequently, we assume that the uncertain parameters include the leaders' information 
and the followers' observations. Then, we determine two cases for the RSE: in case 1, the followers' 
observations are noisy while the leaders possess complete and accurate information sets, and in Case 2, 
the leaders' information sets are uncertain in addition to the followers' noisy observations. 

To evaluate the performance of the RSG compared to that of the NSG, we define two criteria: i) the 
difference between the players' strategies at the RSE and NSE and ii) the difference between the players' 
utilities at these two equilibria. Our results show that for case 1, the uncertain parameters increase the 
leaders' utilities and decrease the followers' utilities. The leaders' strategies are increasing functions and 
the followers' strategies are decreasing functions with respect to the uncertainty region. In contrast, for 
Case 2, the leaders' uncertain information decreases the leaders' utilities and increase the followers' 
utilities. The leaders' strategies are decreasing functions and the followers' strategies are increasing 
functions with respect to the uncertainty region. For both of these two cases, we derive the conditions 
(in terms of system parameters and interactions among users) under which the social utility at the RSE 
increases as compared to that at the NSE. 

In this paper, we derive the relation between the players' strategies at the RSE and at the NSE based on 
system parameters and bounds of the uncertainty region. Based on this, the complexity associated to the 
leaders solving robust bi-level optimization problems is reduced considerably. We initiate the analysis of 
the RSE for the one leader-one follower communication scenario, and generalize it to a multiple-leaders 
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and multiple-followers scenario. 

The rest of this paper is organized as follows. In Section II, the system model and game formulation 
are presented, followed by the definition of the uncertain parameters in Section III. In Section IV, two 
cases for the RSE are introduced and analyzed for the one-leader one-follower scenario. In Section V, 
we apply our theoretical findings to power control games and provide numerical results for the RSE. 
In Section VI, we extend our framework for the RSG to the multiple-leaders and multiple-followers 
scenario, followed by our conclusions in Section VII. 

II. System setup 

A. Network model 

Consider a set of communication resources divided into K orthogonal dimensions, e.g., frequency 
bands, time slots, and routes, which are shared between a set of iV users. Each user consists of both of 
a transmitter and receiver. The set of possible positive transmission actions of the n th user over all the 
dimensions is given by 

A n = {a n = (al ■ • ■ , a*)|a* € [<£,<£], }, Vn G M, Vk € K 

where JC = [!,■■■ ,K] and N = [1, • • • , N]. As an example of such a communication system, consider 
the transmission in the interference channels over K subchannels as depicted in Fig. Q] We denote an 
achieved utility of user n with u n (a), where a = [a n ,a_ n ] and a_ n = (ao, ...,a n _i,a n +i, ...,ajv) is a 
vector of other users' actions except user n. 

We assume that: Al) the utility function of each user is an increasing, twice differentiable, and concave 
with respect to a n ; A2) the utility function of user n can be expressed as 

K 

u n (a n ,f n (a_ n ,x n )) =J^t;*(o*,/*(a_ n ,x fl )), (1) 

k=l 

where f n (a_ n ,x n ) = [/*(a_ n , x n ),..., /,f (a_„,x n )] is the 1 x K vector of the linear aggregate impact 
of other users on user n as 

fn( a -n, x n) = ^ ] a m x nm~^yn (2) 

where x n = [x n i,--- ,X n ( n _ 1 ), x n ( n+1 ), • • • ,x„7v,y n ] is the system parameters of user n, x nm is the 
1 x K vector and x^ im represents the system parameters between user m and user n in dimension k; 
y n = [y n ,--- ,Vn] where denotes the impact of system on user n in dimension k. For example, in 
Fig- CD /n( a -rn x n) i s tne interference of the other users on user n in subchannel k i.e., /^"(a_ n ,x n ) = 
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Ylm^n Hnm a m, + a n< where H^ m is the channel gain between user m and user n in subchannel k, and 
is the transmitted power of user n in subchannel k. For this example, the interference channel gains 
between the users and the noise in each subchannel are the system parameters i.e., x nm = H nm where 
H nm = [H} m , • • • , H^ m ] and y\ = a*. A3) The utility of user n is a decreasing function of /^(a_ n ,x n ); 
A4) the utility function of each user is twice differentiable over a n , and f n (a_ n ,x n ). 

Note that Al is a common assumption for the utility function of all users in the communication 
systems |[T5l , |[T6l . In multiuser communication, A2 and A3 are well-known when the users share the 
same resources among each other and have a negative impact on each other ifTTl . A4 indicates the 
differentiability of utility of each user with respect to a n and f n (a_ n ,x n ). Therefore, all of the above 
assumptions are practical conditions for communication systems. 

This setup includes different game theoretic formulations of communication systems such as additively 
coupled sum constrained games IfTTl . which can model many communication systems for example, i) 
cellular transmission within a given cell, ii) adhoc wireless networks transmission. In this paper, we 
choose power control games as an illustrative example, where the throughput of each user in the system 
is its utility i.e., v n (a„,f n (a_ n ,x n )) = EfcLi lo g(l + f ^ a "_"°^) )- 

The information set obtained by user n is denoted by X n , and it may be empty or contain all the 
private information of other users in the system such as their utilities, and their system parameters. Also 
the information set of each user may be different from the others. The users with empty and non-empty 
information sets are referred to as uninformed or informed users, respectively. 

dv k (a k f k (a x )) 

Clearly, — {fr? ~ * s tne rate °^ change of utility of user n corresponding to its action which 
has a positive value based on Al. A larger value of d" n ( a nJJ, a -".*")) me ans a larger rate of increase of 
the n th user's utility with respect to its action. For example, in power control games, we have 

^(fln,/n(a-n,X«)) fl* 



j,VneAf, VkGlC. (3) 

When the user has a larger direct channel gain i.e., H% n » 1 or it has a small interference i.e., <C 1, this 
value is large. This means that a small change in the user's action causes a large change in the user's utility. 
We consider the column gradient vector of v n for user n, represented by J™ n = V an f n (a n ,f n (a_ n ,x Tl )) 
as an internal rate of user n. Also, let define 

Cnm — ^nm Jf j (4) 



where JJ? = Vf n v n (a n ,f n (a_ n ,x n )) and X ram = diag{(x^ m )^ 1 }. For example, in power control games, 

rL nra rL nm u "n 

fk( fk I rrk „k 
Jn\Jn 1 ^■ L nn u 'n 



CL = ~ ,, H h H Tf n ^ ^neM, VkeJC, (5) 
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where C^ m is the k th element of C nm . Clearly, larger values of H^ m lead to larger values of C^ m or 
larger impact of user m on user n. This example shows that C nm denotes the rate of decrease in utility 
of user n which is linearly related to a corresponding increase in the action of user m. We call C nm as 
the negative impact of user m on user n. In the following sections, we use C nm and J™ n to study the 
effect of robustness on the social utility at the RSE compared to the NSE. 

B. Game formulation 

Now we model the interaction between informed and uninformed users in communication systems 
as a Stackelberg game. Assume that the leaders and followers of the Stackelberg game are denoted by 
Mi = {1, • • • Ni} and Mf = {1, • • • Nf}, respectively and, M = Mi Li Mf is the set of all players in the 
game, where 

-Z-n — {(^rrt) V m , X mn , X mjn , X nm ) m ^ n y m g_/^}- if 71 G Ml 

X n = if n £ Mj 

where X mn = diagKx^)^} and X mm = diag{(x^ im )^L 1 }. In the Stackelberg game, the leaders first 
play their strategy. Then, the receiver of follower n measures the value of f ra (a_ n ,x„) and sends it to its 
corresponding transmitter to decide its action. Since the value of f n (a_ n ,x n ) can be considered as the 
observation of follower n from the aggregate impact of other users, we refer to it as an observation of 
user n. Therefore, the followers' optimization problems can be formalized, 

max 7j n (a n , f n (a_ n , x n )), VneMf (6) 

The solution to © for user n represents its best response, denoted by a*(a_„). Since the followers 
are non-cooperative players, the equilibrium point of the game emerging among them is a* 7 ^ = 
(aj, • • • , a^f ), which satisfies 

^n(a*,f n (al n ,x n )) > u n (a n ,f n (al n ,x n )) a* e An, (7) 

where a*_ n = [ajj, ...,a*„ 1 ,a* +1 , ...,a^] for all n G M. If X n = for all players, the game reduces to a 
strategic non-cooperative game. 

To define the equilibrium of the Stackelberg game, we consider that there is only one leader in the 
Stackelberg game which has index 0. The Stackelberg game equilibrium prescribes an optimal strategy 
for the leader if the followers play their NE. For example, in a one-leader one-follower Stackelberg 
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game, when player 1 is a follower, an action ag is the Stackelberg Equilibrium strategy for the player 
if ao G ^4o we have 

voM NSE M*i(*o NSE )>*»)) > wo(ao,fb(a;(ao),Xn)). (8) 
In this case, the leader's optimization problem changes to the following bi-level optimization 

max f (ao,fo(ai,x n )), Vn e M (9) 

a S.4o 

subject to: max i>i(ai,fi(ao,x n )). 

For multi-follower scenario, let al (ao) = [aj, • • • ,a* N _ l ] be the NE strategy of the followers if player 
plays a . The strategy profile (sIq NSE , a*J^ SE (sIq NSE )) is the Stackelberg game equilibrium iff 

,*NSE e , n *NSE /n *NSE 



v (^™M*-o(*o*n,xo)) > ^o(ao,f(a%(ao),x )),Vao € Ao, 



The achieved utility of player n at the NSE and the social utility of the game are denoted by co^ SE and 



NSE ~ EneAA u n SE > respectively. 



Note that if the followers' game has multiple Nash equilibria, the definition of the Stackelberg equi- 
librium is more complicated as described in (2[, lfl"8ll - ll20ll . In this paper we restrict our study to the 
Stackelberg game with a unique NE in the followers' game. The uniqueness condition for this game is 
provided in Section VI. In the following, we define the uncertain parameters in the system and based 
on them, we introduce different types of RSE and the robust counterpart optimization problems for both 
leaders and followers. 

III. Uncertain parameters 

As stated before, both the followers' observations and the leaders' information sets are uncertain 
parameters in the considered communication scenario. In the following subsections, we define the fol- 
lowers' uncertain observations and the leaders' uncertain information set which we noisy observations 
and incomplete information sets, respectively. 

A. Noisy observation 

Consider the uncertain value of f n (a_ n , x n ) as noisy observation of user n of the impact of the other 
users, which is modeled as the summation of a deterministic nominal value and an error f2Tj, i.e., 



fn( a — 7ij x n ) — f n (a_ n ,x n ) +f n (a_ n ,x n ), (10) 
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where f n (a_ n ,x n ) = [/^(a_ n ,x n ), • • • f* (a_ n ,x n )], f n (a_ n ,x n ) = [/^(a_ n , x n ), • • • (a_„,x n )], and 
f n (a_„,x n ) = [/^(a_ n ,x n ), • • • /^(a_ n ,x n )] are the actual, nominal and the error of the observation 
of user n, respectively. In the worst-case robust optimization theory, the error of noisy observation is 
assumed to be bounded in a closed region named an uncertainty region J2T|, |[22l : 

5ftn(a- n ) = {f„(a_ n ,x n ) G Jt n (a_ n )|||f n (a_ n ,x n )|| 2 < e n } , Vra G Mf (11) 

where e n is the bound of uncertainty region, and ||.||2 is the ellipsoid norm. The noisy observation is 
considered as a new optimization parameter in the utility of each users ||2T1 . The new utility function of 
user n, considering the uncertainty, is u n (a n ,f n ), which satisfies 

-u n( a n>fn( a -n> x n))|e„=0 = v n{ a ni ffi( a -ri) x n))- (12) 

Consequently, the followers' optimization problem changes to 

max_ min u n (a n , f n (a_ n , x n )), Vn G TV} (13) 

a„eA> f„( a _„,x„)£S„(a-„) 

The robust Nash Equilibrium (RNE) of this game by assuming the I n = for all users, is defined J2T|, 
El as a* = ,aVi) iff, 

min M„(a n , fn( a — ni x n )) >_ min u n (a„,f„(a!l ri ,x„)) Va n G *4 n . (14) 

f„ (a* „ ,x„ ) S5R„ (a_ „ ) f „ (a* „ ,x„ ) 6SR„ (a_ „ ) 

B. Incomplete information set 

We refer to the leaders' information set with uncertain parameters as an incomplete information set 
where X re Hl is the uncertain parameter. Note that obtaining this parameter is more challenging compared 
to obtaing others parameters in communication systems. For example, in the power control games, the 
follower's transmitter sends the pilot signals to its receiver to estimate its direct channel gains. The leader 
can extract H re n , by listening to the follower's feedback channel. Also the leader can estimate H n , n/ 
by listening to this pilot signal. Since there is no pilot signal from the follower's receiver, the leader 
cannot estimate H n , nr Following the worst-case approach, the uncertain information is considered as the 
summation of its nominal value and the uncertain part which is bounded in the uncertainty region as 

7Z-x„ ni = {X„ /ni |||X re/re! ||2 = ||X n/n , — X n/n; ||2 < Sn f ni}, Tlf €.J\f, n\ G M\. (15) 

In contrast to the incomplete information set, we refer to the leader's information set without uncertainty 
as the complete information set. 
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IV. Robust Stackelberg Equilibrium 

Using the above definitions of uncertain parameters, we can defined different forms of RSG. In this 
paper, we focus on two common scenarios: 

Case 1: the leaders possess complete information sets, while the followers possess noisy observations; 

Case 2: the leaders possess incomplete information sets and the followers possess noisy observations. 
As an initial point to analyze the RSE, we first study the one leader-one follower scenario. Then the 
extension of RSE for multi-followers and multiple-leaders is provided in Section VI. 

A. Analysis of RSE for case 1 

For this case, the leader's information set is denote by Iq SE1 as, 

-Z-o — {(^mj ^m> X mn , X mm , X nm , K m (a_ m )) m ^ n , V?7l 6 A^}, (16) 

Also the leader knows that the follower's optimization problem is the same as (fT3l ). Therefore, the leader's 
bi-level optimization problem is 

max aoeAl uo(ao,fo(ai,Xoi)) (17) 

subject to: max a e ^ _ min «i(ai, fi(ao, Xio)). 

If a*(ao) is the best response of (fT3l ) to the leader's action, the RSE for case 1 is s^, RSE1 , defined as 

v (af SE \f (aUa*o RSE1 ),*o)) > ^o(ao,fo(5 1 (a ),x )), (18) 

Remark 1: The RSE of case 1 exists since: 1) (fl"3T ) is a concave with respect to ai(ao) for any fixed 
action of the leader, and a decreasing function with respect to fi(ao,Xi), 2) A\ and 9Ji(a_i) are two 
convex bounded and disjoint sets. Consequently, there always exists a saddle point of (fT3T ) ll23l . which 
is a solution to (fT3l ). 

Now we characterize and compute the RSE, which is the most difficult part due to the computational 
complexity of ( fTTl ). In the following, for notational convenience, we omit the arguments of fi(ao, Xi) and 
f (a ,x ). 

Lemma 1: The uncertain observation for the follower's optimization problem is equal to 

? = fi - eitfi (19) 
where 7 1 = [fl*, ■ ■ ■ , ff*], #i = [#{, ■ ■ ■ , #f ], and ®\ is defined as 

»9$ = , 9ft — , (20) 



10 



Proof: See Appendix A. ■ 
Using ( fT9l ) in problem ( fTTT ) removes the uncertainty region from the leader's bi-level optimization 
problem which is simplified as 

max uo(ao,fo), Vn € TV (21) 
subject to: max i>i(ai, f x ), 

By this reformulation, we can derive the difference between strategy of the leader and the follower in 
RSE for case 1 and NSE. 

Proposition 1: For the case 1 of RSG, the leader's action is an increasing function and the follower's 
action is a decreasing function with respect to e\, and they can be calculated as 



t *RSEl _ *NSE 
l l — a l 



eix((Ji iai )^J aifl x^) T , (22) 



„*RSE1 _ „*JVSB , . v ffrO \-l T y /yl \-1t1 v ,qTnT /^x 
a - a + £ 1 X ((Ja ,aoi Ja ,f A 01 (Ja iai ) JaA X #1 ) ■ 

where J£ )Bb = V f „ an u n (a n ,f n ) and J™ ^ = V^i; n (a n ,f n ). 

Proof: See Appendix B. ■ 

From Proposition 1, the solution to the robust problem (TTvT t can be calculated based on the NSE and 
the bound of uncertainty region in (1221 and (|23T ). Therefore, the computational complexity to solve ( fTTT ) is 
reduced. From (fJTJ and Proposition 1 , the variation of utilities of the leader and the follower are derived. 

Proposition 2: For the case 1 of RSG, 1) for any realization of the follower's noisy observation, we 
have, 

, *NSE ^ , *RSE1 , *RSE1 ^ , *NSE 
^0 — ' 1 — 1 

where o;* R is the achieved utility of player n in RSE for case 1 . 2) The social utility of game increases 
compared to NSG, i.e., £j* RSE1 > u:* NSE , if we have 

CI : |Cio| < |JaJ ,C2 : |J a J < |Coi|, 

where |q| is the absolute value of the elements of q. 

Proof: See Appendix C. ■ 
From Proposition 2, uncertainty in the follower's observation increases the leader's utility. In contrast, 
the follower achieves a smaller utility in RSE for case 1 compared to NSE. Interestingly, the social utility 
at the RSE increases compared to NSE if CI and C2 hold. CI and C2 can be interpret as follows: the 
negative impact of the leader to the follower is less than the leader's direct rate and the follower's direct 
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rate is less than its negative impact on the leader. Correspondingly, the increase in the leader's utility is 
larger than the decrease of the follower's utility where the leader's strategy increases and the follower's 
strategy decreases and, as a result, the social utility increases. 

B. Analysis of RSE for case 2 

For case 2, we denote the leader's incomplete information set as X which is equivalent to 

-Z-0 = {(Ani Vrrii X mra , X mm , X nm , K m (tl— m))m7^ni V/Tl £ A/"}, (24) 

where Xio is uncertain parameter with uncertainty region as ( fT5T ). Using the concept of worst-case 
optimization, the leader's bi-level optimization problem is changed to 

max min i>o(ao,fo) (25) 
subject to: max min iti(a-i,fi). 

According to (|25T ). the leader cannot evaluate precisely its impact on the follower. Note that, 1) the fi 
is a linear function of Xio, and 2) the leader considers the worst-case condition of information in the 
uncertainty region to obtain the solution of follower, therefore: 

A5: for case 2, the negative impact of the leader on the follower is a decreasing function with respect 
to $io i.e., V«5 10 fi < 0. 

Remark 2: The RSE in case 2 always exists, because: 1) 7&x 10 , TL\, Aq, and A\ are compact and 
closed sets, 2) for any realization of Xio £ T^x 10 , is closed and convex. Consequently, for any value 
of the leader's uncertain information and strategy, the follower has a feasible strategy. 

While the existence condition of RSE for case 2 can be shown easily, solving problem d25l ) is 
significantly more complex compared to ( fTTT ). In the following, we will discuss about the relationship 
between the RSE for case 2 with the RSE for case 1 and NSE. 

Proposition 3: The leader's utility at RSE in case 2 is always less than the leader's utility for case 1 
namely, 0Jq RSE2 < ujq RSE1 , where uj* RSE2 is the achieved utility of n th user in the RSE of case 2. 

Proof: See Appendix D. ■ 

Proposition 3 shows that the leader's incomplete information set always decreases the leader's utility 
compared to case 1. Next we compare the leader's and follower's utility in the RSE for case 2 to that of 
the NSE. 

Proposition 4: For case 2 of RSG: 1) The leader's strategy is a decreasing function with respect to 
$io and the follower's strategy is an increasing function with respect to 5\o, 2) for all realizations of the 
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leader's incomplete information set, we have 



*RSE2 



< W 



*NSE 



*NSE 



*RSE2 



3) the social utility of game increases compared to the NSG i.e., uj 



*RSE2 > U *NSE jf 



C3 : |J a J < |Cio 



C4: |JiJ > |Coi 



Proof: Appendix E. 



According to Proposition 4, uncertainty in the leader's information set always decreases the leader's 
utility compared to NSE. In contrast, the follower reaches a higher utility compared to the NSE. In this 
case, if the follower's direct rate is greater than its negative impact on the leader (i.e., C3), and the 
leader's direct rate is less than its negative impact on the follower (i.e., C4), the social utility at RSE for 
case 2 increases compared to that at NSE. 

An interesting interpretation arises when comparing CI with C3 and C2 with C4. These comparisons 
indicate that C1-C2 are the dual of C3-C4. In case 1, a higher social utility can be achieved if the increase 
of the leader's utility is higher than the decares of the follower's utility. In contrast, in case 2, the higher 
utility can be achieved if the increase of the follower's utility is higher than the decrease of the leader's 
utility. The variations of utilities of the leader and the follower at RSE compared to NSE for case 1 and 
2 are summarized in Table J] 

We should note that the implementation of above scenarios in practice does not need the synchronization 
between leader and follower. For example if the follower plays first, the leader chooses its action based 
on its information set without considering the follower's action. Then the follower observes the leader's 
impact and plays based on its observation. Therefore, always the actions of the leader and the follower 
converge to the NSE and the RSE regardless of synchronization between them. 



In this section, we validate the above results in the power control game and simplify C1-C4 based 
on channel gains between users. In the power control games, the information set for case 1 equals 
to X§ SE1 = {(An,«m,H mn ,H mm ,H TlTO ,K m (a_ m )) m ^ n ,Vm G Af}. For case 1, the solution of the 
follower's optimization based on the uncertainty is 



First, to illustrate all the results obtained in Proposition 2 and 4, we simulate a single carrier power 
control using the simulation parameters of the first row in Table JT] Fig. |2] shows the variation of utilities 



V. Illustrative Example 



f*k 
Jl 




VA; € K 



(26) 
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of the follower and the leader on the Pareto boundary of the power control game. By increasing e\, the 
leader's utility increases as we expected from Proposition 2. In contrast by increasing the value of 5\q, 
the follower's utility increases and the value of the leader's utility decreases based on Proposition 4. 

To provide practical insight into the CI and C2, we want to express these conditions for power control 
games only in terms of channel gains. The exact expressions of CI and C2 for power control game are 



uk zjk „k uk 

/f x (/f + H k n a\) f^ + H k a k 



CI for power control games : rh ^X^KL < ,k 7 ^ fc ' Vk G /C ' ( 27) 



C2 for power control games : ^oi^oVo > H il vfc £ K _ (28) 

JO x Uo + n oo a o) h + H ll a l 
To simplify the above conditions, we consider three scenarios based on signal-to-interference -plus-noise 

ratio (SINR) of the leader and the follower as: Rl) High SINR scenario i.e., Hq Q clq 3> H^a^ + <Tq and 

H^ai ^> H^qCLq + a k in which CI and C2 are simplified to 

flft < fl"oi. (29) 

R2) Low SINR scenario i.e., HqqCLq <C H k x a\ + anc * ^ii a i ^ -^io a o + °i> wnere the social utility 
increases if 

H k m >H k m and < H k , (30) 

and, R3) Moderate SINR scenario, i.e., where the values of induced interference of the leader and the 
follower to each other are close, i.e., f k w f k . Therefore, CI and C2 change to 

zjk zjk r ,k uk zjk r ,k 

H oo > -"n Q i -"n < H oo a o \/k elC 

uk uk r ,k _j_ „k ' uk uk „k _i_ ^_fc ' 

-"01 -^10% + a l H 10 H 01 a l + ^0 

The above conditions are simplified by assuming that channel noise is much less than the interference 
from other users, e.g., H k a^ S> cr k , and the power transmissions of the leader and the follower are close, 
as 

H k H k > fl*£T* VA; € K. (31) 

Using (|29l)-(l3TI). we can predict how the social utility increases or decreases under given channel 
conditions for case 1. To provide insight about implementing the Stackelberg game in the interference 
channels, we investigate the effect of these conditions on the amount of decrease and increase of the 
leader's utility and follower's utility. Consider <i„ = " ^wse" — to show the percentage of change of 
utility of player n in RSE for case 1 compared to NSE, and d RSEl = ^-^wzw—— as percentage of change 
of social utility of RSE for case 1 compared to NSE. A larger value of d RSE1 indicates a larger increase 
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of utility of player n in case 1. The same notation for case 2 will be applied as = " mNSE ° — 

and d RSE2 = E for user n and social utility, respectively. 

In Figs. [3] (a), (b) and (c), we depict d RSE1 , d RSE1 and d RSEl for Rl, R2, and R3, respectively. 
Increasing the leader's utility and decreasing the follower's utility are evident with respect to e± as 
expected from Proposition 2. When ( f29t - (|3TT > are satisfied, the social utility increases as shown in Fig. [3] 
(a), (b), and (c). In contrast, when (|29l >- (OTT ) are not satisfied, the social utility decreases compared to NSE 
as depicted in Figs. [3] (e), (f) and (g). However, in Rl, the increase of social utility is not considerable if 
(|29l ) holds. In R2 and R3, when (|3Ql > and (|3TT > are satisfied, the leader's utility and social utility increase 
considerably. For example, when (l30l does not hold, d RSE1 reduces from 220% in Fig. [3] (b) to 40% in 
Fig. [3] (e), and d RSE1 decreases from 100% to around —1%. Therefore, both of the accurate information 
set of leader and holding the d30l are valuable from leader and system. The same is true for R3. 

For case 2, the uncertain parameter is Hio- For the above regions, C3 and C4 change to 

Rl: => H^>H k m (32) 

R2: => Hq < H n > H 10 (33) 

R3: => H$ H* < H^H&k. (34) 

The effects of enlarging the value of 8q\ in d RSE2 , d RSE2 and d RSE2 are investigated in Figs. [4] (a)-(f). 
As we expected from Proposition 4, the leader's utilities in all the cases decrease compared to that at 
the NSE, while the follower's utility increases by increasing the value of <5oi. In Figs. [4] (a), (b), and 

(c) , conditions (I32l- (l34l do not hold. Consequently, the social utility decreases. In contrast, in Figs. |4] 

(d) , (e), and (f), the social utility increases by increasing 5\o because d32l-d34l hold. Fig. |4] (a) and |4] (d) 
show that d RSE2 decreases from 1.5% to 1% when (l32l holds compare to the case it dose not hold. The 
same decreasing is observable by comparing d RSE2 in Figs [4] (b) and|4](e) and Fig. |4](c) and (f). These 
comparisons indicate that in all the regions, the follower obtains a lower utility when (|32l)-(l34l hold and 
the social utility increases insignificantly. 

A. Practical remarks 

Comparing Fig. [3] and Fig. [4] shows that the leader's utility and social utility increase significantly for 
case 1. In contrast for case 2, the social utility has lower rate of increase. Also the follower's utility 
increases insignificantly. For example, in R2, in case 1 the leader's utility and social utility increase up to 
200%, while for case 2, the follower's utility and social utility increases up to 10% in R2. Therefore, the 
leader's complete information set is more effective in increasing the social utility. This analysis provides 
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incentives for the system designer to build more efficient protocols to collect the accurate information for 
the leader. Also these results help in designing systems with two or more leaders. Consider that leaders 
want to increase their social utility and they encounter uncertainty in their parameters. If ( f29t - (|3TT > hold 
for one of them and it has a complete information, it plays as a leader and the others play as followers. 
Therefore the social utility of leaders increases considerably. 

B. Power control with maximum transmit power constraints 

Consider that the sum of powers of each player over all subchannels is limited by P™ ax , i.e., 



In this case, the players' strategies are nonlinear functions with respect to their observations O, |[24l . 
Therefore, we cannot use Propositions 1-4 directly. To investigate the performance of RSE with 051 ), we 
show the allocated power over different subchannels in Figs. [5J7] The corresponding leader's utility and 
follower's utility are summarized in Table |nl] The simulation parameters are the same as in Figs. [3] - 0] 
in Table U except P™ ax = a™f = 200 dB. 

Consider K,^ SE C K and K,^ SEl C JC as the sets of subchannels utilized by user n at the NSE and RSE 
for case 1, respectively. Also we define £*™ E = K^ SE n 1C% SE and C^ E1 = K* SE1 n TC* SEl as the 
set of common subchannels between user m and user n at the NSE and the RSE for case 1, respectively. 
According of the simulation results in Figs. [5J|71 we have \C$? E1 \ < \£& SE \, where \£* SE1 \ and \Cg SE \ 
are the size of Cfif E1 and £q 1 se , respectively. For example, l^^ 1 ) = 17 and \C^ SE \ = 13 in Rl 
and \Cqi E1 \ = and |£gj | = 1 in R2. As we expected and seen in Table ITTT1 the leader's utility 
increases for case 1 in all conditions. Interestingly, in R2, the follower's utility also increases compared 
to NSE. Since the number of common subchannels decrease, there is less interference from the leader to 
the follower and viceversa at RSE for case 1 compared to NSE. Therefore, as shown in Table JIJ there 
is a probability that the leader's utility and follower's utility increase simultaneously. 

To quantify this probability, we show the simulated cumulative distributed function (cdf) of di SE1 
for 3 situations of high interference scenarios between the leader and the follower as: SI) the leader's 

H k 

interference to the follower is high, e.g., -rrff- > 0.8, but the follower's interference to the leader is low, 

"a 

e.g., tttt < 0.1; S2) the leader's and the follower's interference are high to each other, e.g., t# > 0.9 

"00 "11 

and t# > 0.9; S3) the follower's interference to the leader is high; e.g., t# < 0.1; but for the leader 

■"00 "11 

to the follower is low, e.g., ttt > 0.9. 



K 




(35) 



k=l 



00 
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From the simulated curves in Fig. [U when both the interference from the leader to the follower and the 
follower to the leader are large, i.e., in S2 there is no probability for the follower to reach a higher utility 
compared to NSE. In contrast, in SI and S3, there is a probability that the follower's utility increases 
e.g., around 0.1. Therefore, the leader and the follower reach higher utilities when there is uncertainty in 
the follower's observations with power constraint in d35l > which is an opportunistic phenomena of RSG 
in case 1 in terms of increasing utility of both leader and follower. 

VI. Extension to Multi-user Game 

The pervious results can be extended for multiple-leaders and multiple-followers in the Stackelberg 
game. Obviously, analysis of RSE in this scenario is more challenging ||25l - l|27l . In order to do that, we 
focuss on the case that the NE of game of followers is unique. 

A. One leader and multiple-followers (Ni = 1 and Nf > 1 ) 

For this scenario there is only one leader in the Stackelberg game denoted with the index 0. Consider 
the Nf x Nf matrix T by following elements 



a™ in if m = n, m,n € Nf 
-finm if m ^ n, m,n e Nf 



[T]nm - 

where 

a n (a) = smallest eigenvalue of — Va v n (a n ,f n ) a™ n = inf a„(a) Vn G Nf (36) 

/Wa) = ||-V a „ am « n (a n ,f n )|| Vn/?n 0™% 4 sup/3 n (a) Vn 6 Nf (37) 



If T is a P-matrix, the NE between followers is unique (Theorem 12.5, [C 

1) Case 1 of RSE: In this case, the followers' observations are uncertain parameters which are modeled 
as (ITTb . In contrast, the leader has the complete information set. For each follower, the optimization 
problem is similar to (T3"3l and the reformulations to (1201 can be applied as 

T n = f n - e n <d n , neNf (38) 

where t n = [/**, • • • , /**], tf n = [<, • • • , and 

9w*(a„,f„) 

< = a/ " = . (39) 

Proposition 5: For case 1 of RSG, if T is a P-matrix; 1) the followers' strategies are decreasing functions 
with respect to e = [e±, ■ ■ ■ ,en] and the social utility of followers' game is less than that of the NSE; 
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2) the leader's utility at RSE is higher than that at the NSE; 3) the social utility increases compared to 
NSE if 

C5 : J° Q > C n0 , C6 : J£ < C 0n + £ C mn , Vn G TV/. 

Proof: See Appendix F. ■ 
The same as in Proposition 2, the followers' noisy observations increase the leader's utility. In contrast, 
they decrease the followers' utilities. The social utility increases in RSE compared to that at NSE, if the 
leader's direct rate is larger than its negative impact on the followers i.e., C5, and, for each follower, 
its negative impacts on the other followers and the leader are greater than on its direct rate i.e., C6. 
Therefore under these conditions, decreasing the follower's actions has more effect to increase of the 
leader's utility and the other followers' utilities compared to decrease of its utility. Also, increase of the 
leader's action has more effect to increase of its utilities compared to decrease of the followers' utilities. 

2) Case 2: In this case, we assume that X ra o is uncertain parameter and the leader considerers the 
uncertainty region with boundary <5„o for each follower as 

^x„ = {X n0 I ||X n0 ||2 = ||X n0 - X n0 ||2 < <W, V?i G Mf. (40) 

Proposition 6: For case 2, if T is a P-matrix, 1) the leader's utility is always less than that of the NSE; 
2) the followers' actions are increasing functions with respect to Sq = [<5io, ■ ■ ■ j^/Vf.oL an d the social 
utility of the followers' game is higher than that of the NSE; 3) the social utility increases if 

C7 : Jac < £ C "0, C8 : J£ > C „ + C mn,yn G Mf. 

Proof: See Appendix G. ■ 
Again, the leader's incomplete information set decreases the leader's utility and increases the social 
utility of followers. Also from C7-C8, if the leader's direct rate is less than its negative impact on the 
followers and negative impacts of each follower on the other followers and the leader are less than of its 
direct rate, the social utility of RSE for case 2 increases compared to NSE. Besides, C5- C6, and C7-C8 
are dual. 

For multiple-followers and one leader scenario, the hierarchy between the leader and followers still 
remains. To implement this scenario in a distributed manner, the leader announces its action first. Then, 
all the followers play the strategic non-cooperative robust game. The robust game between followers can 
be implemented using a distributed algorithm as in |[2ll . 
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B. Multiple-leaders and Multiple-followers: Ni > 1 and Nf > 1 

In this scenario, the definition of RSG and RSE is not unique. There are different forms of NSG 
for multiple-leaders and multiple-followers scenario based on interaction among the leaders such as 
cooperation or competition 11261 . For example, one of the possible way to define the Stackelberg 
game with multiple-followers is to consider cooperation between leaders when all leaders know each 
other and try to maximize their social utility as 

) (41) 

subject to max v m (a m ,f m ) Mm G Aft, 

a m e,A m 

Proposition 7: If T is a P-matrix for the game of followers, leaders' social utility increases for case 1 
of RSE. 

Proof: See Appendix H. ■ 
In this scenario, the major concern is the analysis of case 2. Since ((4T|> is a non-convex and non-smooth 
optimization problem, deriving the conditions for increasing or decreasing the aggregate of leaders' 
utilities (|4TT> is impossible. Nevertheless, a heuristic protocol to increase the social utility of followers 
and some of the leaders is proposed based on the results of this paper in section IV. This protocol is 
summarized in Table |IV] Briefly, in this protocol, the leader with the highest negative impact on the 
followers and the other leaders and the less internal rate is chosen to play as the leader and all the others 
leaders and followers are considered as the followers. Therefore, the social utility of game among the 
followers and the remaining leaders increases with respect to the leader's incomplete information set 
based on Proposition 6. 

In Table |VJ we evaluate the performance of heuristic algorithm where there are two leaders and only 
one follower in power control game. In SNE, both of the leaders maximize their sum of utility based on 
(14T1 ). When the information of leaders are subject to uncertainty, leader 1 acts as the leader and leader 
2 as the follower. In this case, the uncertainty in the leaders causes the decreasing utilities of both of 
the leaders, while the utility of follower increases. However, the social utility of the follower and the 
leader 2 increases for the case 2 of RSE with heuristic algorithm. Clearly, the above heuristic algorithm 
does not guarantee to increase the leader's social utility. However, it guarantees that social utility over 
all players excepts player ni increases for case 2 of RSE. 
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VII. Conclusion 

In this paper, we introduced the robust Stackelberg equilibrium for communication systems under 
different types of uncertainty. We derive that the followers' uncertain information increases the leaders' 
utilities while decreasing the followers' utilities and the leaders' uncertain information increases the 
followers' utilities while decreasing the leaders' utilities. We show that the social utility in case 1 increases 
if the increasing rates of the leaders' utilities are larger than the decreasing rates of the followers' utilities. 
In contrast, in case 2, if the decreasing rates of the leaders' utilities are less than the increasing rates of 
the followers' utilities, the social utility increases. For power control games, we simplify these general 
conditions based on the channel gains between transmitter and receivers in different SINR scenarios and 
provide insights on how to design enhanced protocol for gathering the leaders' information and implement 
high-efficient systems in the multiple-leaders communication scenario. These results can be extended to 
other communication systems to gain insights on how to implement the RSE and NSE. 
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Appendix A 
Proof of Lemma 1 

Since ui(ai,fi) is convex with respect to fi, the internal optimization problem of the follower in (fTTT ) 
can be solved as 

K K 

L(a 1 ,f 1 , A) = $>^i, ft) + XC^Gt ~ fi? ~ e?), (42) 

k=l k=l 
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where A is the nonnegative Lagrange multiplier that satisfies (fTTT ). i.e., 



K 



Ax(e?-^(/f-/f) 2 ) = 0. (43) 



k=l 



The solution of (1421 according to can be obtained by optimality condition of optimization problem 

without constraint (291, i.e., d£( a i</i< A ) _ q, which is equivalent to 

dft 

^HJh = _2A x - /*) Vfc € A:. (44) 
dfi 

If the above solution is inserted in (l43l . the uncertain parameter is obtained as (fT91 ). 

Appendix B 
Proof of Proposition 1 

1) At the RSE, the first order optimality condition holds for the follower's optimization problem as, 

V7 „, ( n *RSEl f*RSEl\ n fAt rx 

\/ a ,nsEiui{a l ,t l ) = U, (45) 

where f^ RSEl [ & derived by aQ RSE1 . The derivative of (1431) with respect to e\ is 

TV7 2 „, ( n *RSE\ ptRSEl^ a *RSEl , vy „, (n *RSEl f *RSEl\ V 7 f*RSEl] n /^/-x 

If ei = 0, the uifcf^f^) is equal to Vl (al NSE ,1t NaE ), where f^ 5B is obtained by aj^. 
Also, from (fT91 ), the last term of left hand side of (14"6l ) is equal to — Therefor, by rearrangement of 
(l46l ). we have 

V^a^^d^J-^x^. (47) 

From Al, A2, and A3, the right hand side of d47l is negative, hence V ei a^ i?S ' E1 < 0. Therefore, the 
follower's action is the decreasing function with respect to e\. 2) In the RSE case 1, the first order 
condition holds for utility of leader with respect to a* RSE1 as 

V7 „, ( a *RSEl f*RSEl\ n M q\ 

V a *RSEiu {a ,t ) = U. (48) 
The derivative of d4"8l ) according to e\ is 

\\7 2 n, (c* RSE1 f*RSEl\ w „*RSE1 , 

v a3 flSE1 fs RSi5in o(a ,t J x X i x V £l a 1 J £l=0 = U, (49) 

which is equivalent to 

vy ^RSEl _ / T \-1t0 \- Y7 a *RSEl sc m 
V £i a - _ l J ao,a ) Ja ,fo A 01 V £i a l ■ P U -> 

From Al- A3, the right hand side of (l50l ) is positive. Therefore, we have V £l aQ RSE1 > 0. Hence, the 
leaders' action is an increasing function with respect to e\. 
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Appendix C 
Proof of proposition 2 

1) The Taylor series of the leader's utility around the uncertain parameter is 

f a *RSEl e*RSE\ _ „, (c .*NSE f*NSE\, /<r n 

u o(a ,l J - fo(a ,t j+ (51) 

r- r/Y T7 „ / n *RSE\ j^-RS-EInnTt-, n *RSEl . „, (n *RSEl f*RS El \ \ \Tv-y „*R5Sli . „ 

£i[(X iV^BSEi'uo(a ,t )) Vei^i + (V aS RSEiuo(a ,t ))) V £l a J £l=0 + o, 

Note that in the following proves, we consider the first term of Tailor series and ignore higher terms 
because 1) f n is a linear function of a_ n and its higher order derivatives are equal to zero; 2) u n is a 
concave function of a n and the second order derivative is negative, 3) the value of higher order terms 
is very small values because of higher exponents of e\. From A2 and V £l a\ RSEl < 0, the first part of 
(|5"TT) is always positive. Also, the second term has only positive elements. Therefore, the utility of leader 
is always larger than to that of NSE in the case 1. By some rearrangements, we have 

. ,*RSE1 , *NSE ^ _ // T \Ti-j „*RSE1 , / Y t0 „*RSE1\ (e r^ 

2) Again, we start with the Taylor series of the follower's utility around e± as 

tuft* SBl^RSE^ = Vi{a *NSE X NSE )+ (53) 

In this case, the first term of the Taylor series is always negative, since Vf 1 vi(ai, fi) < and V £l aQ RSE1 > 
0. Also, V ai ui(ai, fi) > and V £l al RSE1 < 0. Consequently, the second term is negative. Hence, the 
follower's utility at RSE for case 1 is always less than the utility at NSE. By some rearrangement, we 
have 

, ,*RSE1 , ,*NSE ^ _ v / (v il „*RSE1 , (t1 „*RSE1\ /<r 4 -> 

u)x -u} x ~ e\ x ((AoiJfJ v £l a + (J a J v ei a i J- P 4 ) 

3) The social utility of RSE for case 1 increases if 

, ,RSE1 , ,*NSE , , *RSE1 , *NSE ^ n /cc-, 
COq — LOq t <-<->! — > u. pjj 

Because V £l a\ RSEl < and V £l a^ 5£;i > 0, (gSJ leads to 

J° + XoiJf 1 , > 0, 4 + X 10 J? < 0, (56) 

The above conditions are equal to CI and C2. 
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Appendix D 
Proof of Proposition 3 

From incomplete information set, the leader cannot calculate the exact value of ai. Therefore, the value 
of fo is uncertain. Consequently, the RSE of case 2 can be considered as e Stackelberg strategy space for 
case 1 (definition 4.7 of HI). Since the leader's utility is continuous ( from Al and A2), there always 
exists a positive value i.e. q > 0, where Uq RSE1 — ojq RSE2 < q (Property 4.2 of ID) and this difference 
is concinnous (Property 4.3 of HI). 

Appendix E 
Proof of Proposition 4 

1) To obtain uj RSE2 — uiq SE , we first derive the V^rse2Si* RSE2 . From the first order optimality 
condition as = at a\ RSE2 , we have 

Jl ia y ri "s^f SE2 + Jl lfl = 0, (57) 

Consequently, we have Vf*RSE2al RSE2 = — (Ji I a I ) -1 Ja 1 f 1 which is always negative. V<5 10 a* E2 is equal 
to 

V7 n *RSE2 V7 n *RSE2 T7 e *RSE2 rKQ \ 

From A5, (|58l ) is always positive. Therefore, the follower's action is an increasing function of Siq. The 
Taylor series of ui(a* RSE2 Xi RSE2 ) around 5 W is 

X„ H U*RSE2 f*RSE2\ 

„ (ct *RSE2 **RSE2\ _ „ (ct *NSE f*NSE\ , r v r g^Ufl > *1 J ] , 

«i(a ) = v 1 {a ,t 1 ) + diox[ J<5 10 =o,£ 1 =o + °, 

adio 

where 

du l {^ RSE2 Xi RSE2 ) _ 

(Y7 „, ( n *RSE2 f*RSE2\\T v7 a *RSE2y7 f *RSE2 , 

fV7 ,,fa*RSE2 f*RSE2\\Tyj f*RSE2 ((-ry. 

{Vf i RSE2U{a 1 ,t 1 )) V / 5 10 t 1 , (t)U) 

and it is simplified to 

, ,RSE2 , NSE ^ //jl \T T 1 / T l \-l / T l \T\v7 e *RSE2 / An 

from Al - A5, the follower's utility increases with respect to 5iq compared to that the NSE. 2) We start 
with = to derive the S/ a *Rs E 2a^ RSE2 as, 

Ja a V ar «SE2ao fl5£;2 + XqiJ^ = 0. (62) 
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Also, the Taylor series around the uncertain information is 

dSi 



A,, („*RSE2 f *RSE2\ 

„, ( a *RSE2 e *RSE2\ „, /n *NSE f *RSE2\ , x v r ^O^Q 1*0 J ] , ^ 

^o(a ,to ) = v o( a o )*o j + ^iox «5 10 =o + o, (63) 



where 

'*0 '*0 



dv (a* RSE2 f* RSE2 ^ 



dSio 

fV7 „ ( a *RSE2 f*RSE2\\1 V7 n *RSE2 T7 n *RSE2 , 

(V fo wo(a5 fl S£2 ,fo)) T X iV 5loai + o, (64) 
By inserting (l62l and (l58l ) in (l64l . we have 

u; ^ 2 - u>» SE * <5 10 x [-(J^/Ja^foXoida^ao)- 1 + (Jg ) T X i] V 4lo ai , (65) 

which is always negative from A1-A5. Now we want to derive the conditions for increasing the social 
utility. Since V a .RSE2&Q RSE2 x Vs 10 Hq RSE2 < 0, to increase the social utility, the sum of second terms 
in right hand side of (l60l) and the first term in right hand side of (l64l should be negative as 

Ua„l - UfJ x IXioI < 0, (66) 

Since V '$ 10 &\ RSE2 > 0, the sum of first term in right hand side of (l60l and the second term in right hand 
side of (l64l should be positive to increase the social utility as 

Ulj-Ufo! x |Xoi| >0. (67) 

Clearly, (j67]l and lead to C3 and C4. 



Appendix F 
Proof of proposition 5 

Lemma 2: If T is a P matrix, the followers' strategies are decreasing functions with respect to 

e = [ei, ■ ■ ■ ,e Nf ]. 

Proof: Assume that the followers' strategies i.e., aw) = [ai, • • • ,&N f ], are increasing functions of 
e, i.e., 

a RSEl > a NSE /^ox 

a^. > a_^ f , (be) 

If T is a P-matrix, J"(ajv>) = (J^ (a n ))^ 1 is strictly monotone ( Corollary 2.6.4 in 11281 ). The 
equilibrium of the followers' game is a solution of VI(UnLi -An x R«, J) EE then, 

(aftf 1 - ^ E )J{^ E ) < (69) 
(aJvf E " a#f ).7(a#f ) < (70) 
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by substituting ( 1691 ) from (1701 ) and considering (1681 ). we have 

J(^ El ) > Jtftf*), (71) 

From OS), 



and 



From = - £l x |1 x g! f%ffl x (^f =1 (^iM) 2 )"i which is negative according to A1-A3, 



duiiajJn) = <KKJt) + x dft (72) 
da*; da^ da k n ' 

d73l ) is equal to 

J(a% SE1 )-J(*^ E ) = z<0, (74) 

where a = (a n )^ 1; = [a*, • • • ,a^] which is contradict with dTP) . This contradiction implies that 
our assumption is wrong. Therefore, the followers' actions in the Robust Stackelberg game case 1 are 
decreasing functions with respect to e. ■ 
1) Since the followers' strategies are decreasing functions of e, the value of fg decreases which implies 
v RSEi > v nse f rom j±2. Consider the variation of the leader's action with respect to the bound of 
uncertainty region of all followers as V e ao- The Taylor series of v^ SE1 around e is 

v* SE1 » vg SE + [(V e a )X + ?n x [XonJi (V e „a n ) T ] + o. (75) 

71=1 

2) The RNE of the followers in multiple-followers of RSG in Section V belongs to Robust additively 
coupled game introduced in lf2Tl . Based on Theorem 2 of ED, the social utilities of the followers at 
RSE is less than that at NSG, if T is P matrix. The Taylor series around e is 

Nf 

Vn SEl ~ Vn SE + £n x [(J£ ) T X n0 V £ a + £ X nm V £m a m + (J£ ) T V £ „a n ] + o. (76) 

3) The social utility at RSE case 1 increases, if the summation of (|75T ) and (1761 ) is positive. Therefore 
the terms multiplied by V £ „ao should be positive because V £ ao > 0. Since V £n a n < 0, its multiplied 
terms should be negative. These two conditions leads to C5 and C6 by some rearrangements. 

Appendix G 
Proof of proposition 6 

Lemma 3: If T is a P matrix, the followers' strategies are increasing functions with respect to 

^0 = [#10) • • • :^N f ,o\- 
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Proof: Assume the follower's strategies i.e., ajv> = [ai, • • • , ajv*]. w& the decreasing functions of 

r , f , -RSE2 ^ ~NSE 

Therefore, we have a_/y < a 
( Theorem 12.5 in |[28l ). and we have, 



(5q. Therefore, we have 3^f E2 < a NSE . If T is a P matrix, J~(sij^ f ) = (Ja„(an))^=i is strictly monotone 



J(*W, E 2 ) < ) (77) 



Now, from the Taylor series of the followers' utilities round 5q, we have 



5< da* ^ffc* a<5 n0 



from A5, the last term of d78l ) is positive. Therefore 

.7(a£f 2 ) - J($f*) = b > (79) 

where b = (b n )^ 1 and k th elements of b n is equal to the last term of d78l ). From (l64l ). J'ia^f^ 2 ) > 
jT(aj^ ) which is contradict with (P771 ). Therefore the followers' strategies are increasing functions with 
respect to do- ■ 
1) If the followers' actions are increasing functions of So, the value of fo increases. Since vo is a 
decreasing function of fo, the leader's utility is a decreasing function with respect to Sq. By using the 
Taylor series, we have 

Vo SE2 ~ Vo SE + E 5 »° x ((J? ) Tx 0nV,5„ o a n ) + (J° o ) T V 5 „ o a ) + o. (80) 

2) Assume that the followers' utilities are decreasing functions with respect to 8o. In this case, the 
followers' strategies are decreasing functions with respect to So. This is because J™ is strong monotone 
when T is P-matrix. However, this is contradict with Lemma 3. Therefore, the followers' utilities are 
increasing function with respect to So- Consequently, the social utility of game converges to the higher 
utility. Besides, the Taylor series of each follower's utility around 5 n o is 

v* SE2 « v% SE + 5 n0 x [(J f "J T X n0 V 5o „a + (JFJ T ( £ X nm V 5m0 a m ) + (J") T V 5on a n ] +o. (81) 

m^n, m£A// 

3) The social utility of RSE2 increases compared to that at the NSE, if the summation of second part of 
Taylor series in (1751 ) and (f8Tb are positive. In this case, V$ n0 ao < and V5 nQ a n > 0, therefore the terms 
related to the V<5 ao should be negative, and the terms related to X7s n0 a n should be positive. By some 
rearrangements, the C7 and C8 are obtained. 
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Case of robust game 


Leader 


Follower 


The social utility increases if 


case 1 


, ,RSE1 , NSE 
w d. ^0 


, RSE1 s , ,NSE 


Ua„| > |Cl | , |Jaxl < |Coi| 


case 2 


, RSE2 s , ,NSE 


, RSE2 ^ , ,jVS£ 


|Ja | < |Cl0|, UaJ > |Coi| 



TABLE I 

Summary of Propositions 2 and 4. 



Number of Fig 


Ni 


N f 


K 


Tjk 


max 
a nk 


min 
a nk 


cr^ for all k 


m 


1 


1 


1 


A Rayleigh model (3) 


10 dB 





0.01 


BE 


1 


1 


20 


A four-ray Rayleigh model t3l 


10 dB 





0.01 



TABLE II 

Simulation parameters for power control games 



Appendix H 
Proof of Proposition 7 

By introducing robustness in the followers' optimization problems, their strategies at the RSE decrease 
[Theorem 2 in EH]. Consequently, the followers' impacts on the leaders decrease, and the leaders' 
utilities increase. Therefore, the leaders' social utility increases. 



i 



Transmitter N JJ 



Fig. 1. Illustrative example: power control games in interference channels. 
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)l i Throughput of the leader 



Fig. 2. Variations of utilities of the follower and the leader for case 1 and case 2 of RSG. 




(d) (e) 



Fig. 3. Validating Proposition 2 with the numerical example of power control, (a) in Rl and l |29b holds (b) in R2 and i30\ 
holds, (c) in R3 and d 3 1 1 > holds, (d) in Rl and j29\ does not hold (e) in R2 and d301 > does not hold, (g) in R3 and d 3 1 1 > does not 
hold. 



Different scenarios based on SINR 


Rl 


R2 


R3 


achieved utility for case 1 


RSE 


NSE 


d% 


RSE 


NSE 


d% 


RSE 


NSE 


d% 


leader 


92.17 


71.95 


28.1% 


110.11 


105.71 


4.2% 


156.01 


119.89 


30.13% 


Follower 


53.47 


70.84 


-24.5% 


97.67 


95.94 


1.8% 


50.15 


96.55 


-48.06% 



TABLE III 

The leader's and the follower's utility subject to (f33T > . 
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Fig. 4. Validating Proposition 4 with power control games, (a) in Rl and d32b does not hold, (b) in R2 and d33t does not 
hold, (c) in R3 and does not hold, (d) in Rl and {32% holds, (e) in R2 and $33$ holds, (g) in R3 and GU holds. 
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T 1 1 1 1 1 1 r T T 














1 


| Power allot aleil ot IcailiT i» ci sukluiinitjl.i in NSE ia Rl 
Power allocated of follower over all subchannels in NSE in Rl 

M 1 1 1 1 1 1 1 1 





2 4 6 8 10 12 14 16 18 20 

Subchannels 

(a) 




Subchannels 

fb) 



Fig. 5. Power allocation of the follower and the leader in Rl subject to 05b (a) at NSE and (b) at RSE for case 1. 
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| Power allocated of leader over all subchannels in NSE in R2 
Power allocated of follower over all subchannels in NSE in R 



n 



12 14 16 18 20 



Subchannels 

(a) 



I- 
i 

: 20- 



in 



^| Power allocated of leader over all subchannels in RSE in R2 
— B Power allocated of follower oner all subchannels in RSE in R2 

lllllllllllll 



12 14 16 18 20 



Subchannels 

m 



Fig. 6. Power allocation of the follower and the leader in R2 subject to l |35l > (a) at NSE and (b) at RSE for case 1. 



35 p 
J 25- 



I I 



| Power allocated of leader over all subchannels in NSE in R3 
Power allocated of follower over all subchannels in NSE in R3 

i i i i i i i i i i i 



I 20- 

5 10- 



| Power allocated of leader over all subchannels in RSE in R3 
Power allocated of follower over all subchannels in RSE in R3 



llllllllllllll 

6 8 10 12 14 16 18 
Subchannels 

HI 



Fig. 7. Power allocation of the follower and the leader in R2 subject to 05b (a) at NSE and (b) at RSE for case 1. 
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-100% -80% -60% -40% -20% 0% 20% 40% 50% 

d RSE, 



Fig. 8. CDF of df SE1 when <(35> holds. 



Heuristic protocol for multiple-leaders and multiple-followers 
Start ni = 1 and ni 6 JVj, 

Consider Af™ w = { 1 , . . . , m _ u n l+ u N t } [j M f , 

For all the players of Stackelberg game except leader n;, 

Calculate C7-C8 by considering n; = and NnV" as the set of followers. 

If C7-C8 hold for n u 

Play RSG with ni as leader and others as followers: 

1) Leader n; announces its strategy, 

2) All the players of M ni play the strategic game, 

3) break. 

otherwise n; = ni + 1, 

continue , 

If m = Ni + 1, 

End. 



TABLE IV 

Heuristic protocol for case 2 of multiple-leaders and multiple-followers. 



Utility 


NE 


NSE 


RSE for heuristic algorithm 


Leader 1 


7.07 


5.06 


4.97 


Leader 2 


1.67 


4.33 


4.19 


Follower 


1.7 


2.08 


2.49 


Social utility of the leader 2 and the follower 


3.37 


6.41 


6.68 



TABLE V 

Performance of the heuristic protocol. 



